Yozgat Province
Make Satire Boring Again: Reducing Stylistic Bias of Satirical Corpus by Utilizing Generative LLMs
Ozturk, Asli Umay, Cekinel, Recep Firat, Karagoz, Pinar
Satire detection is essential for accurately extracting opinions from textual data and combating misinformation online. However, the lack of diverse corpora for satire leads to the problem of stylistic bias which impacts the models' detection performances. This study proposes a debiasing approach for satire detection, focusing on reducing biases in training data by utilizing generative large language models. The approach is evaluated in both cross-domain (irony detection) and cross-lingual (English) settings. Results show that the debiasing method enhances the robustness and generalizability of the models for satire and irony detection tasks in Turkish and English. However, its impact on causal language models, such as Llama-3.1, is limited. Additionally, this work curates and presents the Turkish Satirical News Dataset with detailed human annotations, with case studies on classification, debiasing, and explainability.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe (0.14)
- Asia > Middle East > Republic of Türkiye > Ankara Province > Ankara (0.04)
- (4 more...)
- Media > News (1.00)
- Government (1.00)
Spatio-Temporal Anomaly Detection with Graph Networks for Data Quality Monitoring of the Hadron Calorimeter
Asres, Mulugeta Weldezgina, Omlin, Christian Walter, Wang, Long, Yu, David, Parygin, Pavel, Dittmann, Jay, Karapostoli, Georgia, Seidel, Markus, Venditti, Rosamaria, Lambrecht, Luka, Usai, Emanuele, Ahmad, Muhammad, Menendez, Javier Fernandez, Maeshima, Kaori, Collaboration, the CMS-HCAL
The compact muon solenoid (CMS) experiment is a general-purpose detector for high-energy collision at the large hadron collider (LHC) at CERN. It employs an online data quality monitoring (DQM) system to promptly spot and diagnose particle data acquisition problems to avoid data quality loss. In this study, we present semi-supervised spatio-temporal anomaly detection (AD) monitoring for the physics particle reading channels of the hadronic calorimeter (HCAL) of the CMS using three-dimensional digi-occupancy map data of the DQM. We propose the GraphSTAD system, which employs convolutional and graph neural networks to learn local spatial characteristics induced by particles traversing the detector, and global behavior owing to shared backend circuit connections and housing boxes of the channels, respectively. Recurrent neural networks capture the temporal evolution of the extracted spatial features. We have validated the accuracy of the proposed AD system in capturing diverse channel fault types using the LHC Run-2 collision data sets. The GraphSTAD system has achieved production-level accuracy and is being integrated into the CMS core production system--for real-time monitoring of the HCAL. We have also provided a quantitative performance comparison with alternative benchmark models to demonstrate the promising leverage of the presented system.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- North America > United States > California > Riverside County > Riverside (0.14)
- (63 more...)